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ON WEIGHTED [/-STATISTICS FOR STATIONARY PROCESSES 

By Tailen Hsing and Wei Biao Wu 

Texas A&M University and University of Chicago 

A weighted [/-statistic based on a random sample Xi, . . . ,X n has 
the form U n = X^i<j j< n w i-jK(Xi,Xj), where K is & fixed symmet- 
ric measurable function and the Wi are symmetric weights. A large 
class of statistics can be expressed as weighted [/-statistics or varia- 
tions thereof. This paper establishes the asymptotic normality of U„ 
when the sample observations come from a nonlinear time series and 
linear processes. 

1. Introduction. Consider the causal process 
(1) X i = F(...,e i . 1 ,e i ), 

where the £j are i.i.d. random elements. Clearly (1) is very general and 
represents a huge class of processes. In particular, it contains the linear pro- 
cess Xi = a j £ i-j > where Oj are square summable and £j has mean 
and finite variance, and many nonlinear processes (cf. Section 3) includ- 
ing the threshold AR (TAR) models [Tong (1990)], AR with conditional 
heteroscedasticity (ARCH) models [Engle (1982)], random coefficient AR 
(RCA) models [Nicholls and Quinn (1982)], and exponential AR (EAR) 
models [Haggan and Ozaki (1981)]. The main goal of this paper is to con- 
sider the asymptotic behavior of the following statistic: 

U n = H i,j( x i, x j) : = E m-jKiX^Xj), 

where K is a fixed symmetric measurable function and the Wi are symmetric 
constants. We refer to U n as a weighted [/-statistic. The class of statistics 
that can be written in this form or variations of this form is clearly huge. For 
example, if Hi 1 i 3 (xi,X2) = [G{x\) + G(x2)]/2, n~ l U n is the partial sum of 
G(Xi), . . .,G(X n ); if H ht i 2 (xi,X2) = xix 2 /(|ii - ^| = k), then (n - k^Un 
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is the sample covariance function of lag k in {Aj}; if Hj 1; i 2 = 7^ i<i)K and 
K, respectively, for some fixed function K, then U n is a (nonnormalized) El- 
and V-statistic, respectively. 

The study of asymptotic properties of the weighted or even the usual 
[/-statistics is in general not straightforward. Hoeffding's decomposition [Ho- 
effding (1961)] provides a powerful tool for understanding the large-sample 
properties of [/-statistics based on i.i.d. or even weakly dependent observa- 
tions. See Randies and Wolfe (1979), Serfling (1980) and Lee (1990). For the 
i.i.d. case, a small number of papers consider the asymptotic properties of 
weighted [/-statistics; recent references include O'Neil and Redner (1993), 
Major (1994) and Rifi and Utzet (2000). For weakly dependent processes, the 
results for U -statistics are typically developed under mixing conditions; ex- 
amples of these can be found in Yoshihara (1976), Denker and Keller (1983, 
1986) and a series of recent papers by Borovkova, Burton and Dehling (1999, 
2001, 2002). Laws of large numbers for [/-statistics of stationary and ergodic 
sequences were considered by Aaronson, Burton, Dehling, Gilat, Hill and 
Weiss (1996) and Borovkova, Burton and Dehling (1999). For long-memory 
processes, U -statistics and quadratic forms were considered by Dehling and 
Taqqu (1989, 1991), Ho and Hsing (1996), Giraitis and Taqqu (1997) and 
Giraitis, Taqqu and Terrin (1998), among others. 

Using martingale-based techniques, we prove some general results for U n 
for processes satisfying (1) in a variety of short- and long-memory situa- 
tions. Approaches based on martingales are very effective in dealing with 
asymptotic issues of stationary processes. See Woodroofe (1992), Ho and 
Hsing (1996, 1997), Wu and Mielniczuk (2002) and Wu (2003) for some re- 
cent developments, where certain open problems are dealt with. Wu and 
Woodroofe (2004) investigate approximations to sums of stationary and er- 
godic sequences by martingales. Based on such approximations, they obtain 
necessary and sufficient conditions for such sums to be asymptotically nor- 
mal from the martingale central limit theorem. No mixing conditions will 
be involved and the results obtained are often nearly optimal. 

Specifically, in Section 2, we will state two general central limit theorems 
for a stationary process Yij, where Yjj is measurable with respect to the 
cr-field generated by Ek,k < i V j , where i V j = max(i, j). An example of 
Yij is Yij = K(Xi,Xj), but the realm of possibilities goes beyond that. In 
addition to the dependence of the process Yjj, the Wi introduce another 
level of dependence in U n . The two cases of J2t^o \ w i\ < 00 an d X^o \ w i\ = 
00 correspond to short- and long-range dependence, respectively, thereby 
entailing norming sequences of different orders of magnitude. We will address 
both cases. 

In Section 3, we apply the results to nonlinear time series that are ge- 
ometric moment-contracting. These are "short-memory" processes, which 
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include a large class of processes mentioned in the beginning of this sec- 
tion, and also processes that do not satisfy any strong mixing conditions. 
In Sections 4 and 5, respectively, our general results are applied to short- 
and long-memory linear processes. In the long-memory case, we let Yy be 
the remainder of an ANOVA decomposition of K(Xi,Xj). The resulting de- 
composition of U n is similar in spirit to Hoeffding's decomposition, and the 
asymptotic distribution of U n can be determined by identifying the dominant 
term(s) of the decomposition. In Sections 3 and 4, we also compare some of 
our results with related results in Borovkova, Burton and Dehling (2001). 
The two sets of results have overlapping but somewhat different ranges of 
applicability; we explain the differences and, where they overlap, we point 
out situations where our results work more effectively. 
Detailed proofs are included in Section 6. 

2. Notation and main results. Let £i,i G Z, be i.i.d. random elements 
taking values in a general state space. Define the shift processes Zj = ( . . . , £j_i, £%) 
and, for each £>1, Z, = = (£j_^ + i, . . . ,£»), where we often suppress 
£ in Zj ^ to simplify notation. Let Yij,i,j G Z, be random variables with 
zero means and finite variances, such that Yij = Yji, Yij G o"(Zj V j) and 
(Yij,Zk) is a stationary process in the sense that the (Yi+t^+t^k+t) have 
the same finite-dimensional distributions as (Yij, Zfc) for each t G Z; similarly 
let Yij,i,j G Z, be random variables with zero means and finite variances, 
such that Yj = Yjj, %,j G c(Zj, Zj) and (Yij, Z&) is a stationary process in 
the sense that the (Yi+t j+u %k+t) have the same finite-dimensional distri- 
butions as (Yj 7,Zfc) for each t G Z. Define the projection operator 

P t £ = £(£|Zt)-£(£|Zt-i), i€Z, 
where £ is an integrable random variable. Let 

Li j = Wi—jYij and Lij = Wi—jYij. 

The two cases where the weights Wi are summable and nonsummable have 
distinct flavors, and they will be considered separately in Sections 2.1 and 2.2. 

2.1. Summable weights. In this section, we consider the asymptotic dis- 
tribution of J2i<i,j< n w i-jYi,j, where the weights Wi are absolutely summable. 
When Yij = K(X^ Xj) — EK(X^, Xj), obvious examples of this include par- 
tial sums for which Wi = J^o and fe-lag sample covariance function for which 

w i — $ik- Let denote convergence in distribution and let N(0,cr 2 ) be the 
normal distribution with mean zero and variance a 2 . 
For any integers i,j, define 

(2) 0ij = \[PoYij\\. 



4 



T. HSING AND W. B. WU 



Theorem 1. Assume that 

oo oo 

(3) ^2^2\w k \6i,i-k <oo. 

k=0i=0 

Then 

(4) -L ^ ^-N^cx 2 ) 

V l<ij<n 

for some a 2 < oo. 

Remark 1. Since 0jj = 9ji and u;^ = u>_fc, (3) is equivalent to the seem- 
ingly stronger statement 

oo oo 

(5) Y ^2\wk\0i,i-k < oo 

k=— oo i=0 

in view of 

— 1 oo oo oo 

Y Y \ W k\®i,i-k = Y \ W k\ Y 
k=~ooi=0 k=l j=0 

oo oo oo oo 

= Y \ Wk \ Y e ^~ k - Y Y \ W k\Gi,i-k- 
k=l i=k k=0i=0 

2.2. Nonsummable weights. The derivation of the main result, Theo- 
rem 3, in this section for nonsummable weights relies on Theorem 2, which 
asserts that J2i<i,j<n^i,j can be approximated by J2i<i,j<n^i,j- To consider 
the asymptotic behavior of the latter, we apply the idea of the Hoeffding 
decomposition. 

Let 

§i d = sup\\V Yij\\ 

i>\ 

and 

(6) <^:=sup||Yij - 
Define 

n 

W n {i)=Y w i~3 and W n = 



n 

Y W nd)/n 
.1=1 
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Theorem 2. A ssuTfic that lirci^ >00 1 

and 



0, liminf n ^ 00 W n /(Er=ol^l)> 



(7) 

Then 

(8) 



limsup^min(6' iii _ fc ,e) = 0. 



1 



lim limsup 

t^oo n-»oo nW£ 



l<i,j<n 



0. 



Theorem 3. Assume that Ya^=i \ w i\ = 00 Dfc=o ( n — ^) u 'fc = °( n ^n)- 
Then under the conditions of Theorem 2, 

(9) E ^-N(0, CT 2 ) 

vnW; i<i,i<n 

/or some o~ 2 < oo. 



Remark 2. The assumptions on the in Theorems 2 and 3 are very 
minor and are satisfied for every situation of practical interest. For exam- 
ple, if w n ~ C/n 13 , < 1, then those conditions hold. Note, however, that in 
Theorem 3, the second condition J2k=oi n ~ k)w% = o(nW%) cannot be de- 
rived from the first one J2i^i \ w i\ = oo. For example, let w n = 2 k whenever 
n = 2~ , k € N, and w n = otherwise. Then J2iZi \ w i \ = oo and there exists 
a constant d > such that X)fc=o( n ~~ — c-'nW 2 for all n > 4. 



The conditions (3) and (7) are closely related through 5(. The following 
is useful in verifying the conditions in certain situations. 



Proposition 4. The following hold: 

oo oo 

(10) sup]T^-fc< 2 E^ 

k i=0 i=0 

and, for any e, 

OO OO / \ 

(11) supE min (^,i-fc,e) < 4^min( sup^,e). 

k i=0 i=0 \£>i 



Proof. Let j > i > £ > 0; then Zj and Z,- are independent of Zo- Thus 
i^j is also independent of Zq and VoYi,j = 0. If z > £, j < — 1, then Zj is 
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independent of Zo and Zj is Z„i measurable. So E^jlZo] = £?[l^j|Z_i] 
and, again, VoYij = 0. Therefore, 

6 ltl - k = \\V Y t/ _ k \\ = \\V (Yw-k ~ Yi,e~k)\\ 

<\\Ytt-k-Ytj-k\\<5t, k>£>0, 

(12) 

W = \\VoY e+ kA = \\Vo(Y e+ k,e ~ Y e+k/ )\\ 
< \\Ye+k,e - Y e+k/ \\ < 5 £ , k,£>0, 
by Cauchy's inequality. Hence 

oo k— 1 oo oo 

i=0 i=0 i=0 i=0 

proving (10). To prove (11), similarly write 



^min(0 M _ fc ,e) 

i=0 

fe — 1 y \ OO y 

( 13 ) = Vminf sup ||"P ^,i-fe|| , e ) + Vminf sup||Po*i 

,_ n \e>o / ,_ n \e>o 



i=0 xt ^ u 7 i=0 

fc— 1 / \ OO y 

= Vminf sup||P o ^i,i-Jfc|| 7 <0 + Vminf sup HPo^+ImII' e 
i=o y i=o V ^ 

Now, for < i < k — 1, by the triangle inequality and (12), 

||Wi,i-fc|| < + WPo( Y i,i-k ~ Yi,i-k)\\ < $i + S £ , 

where the same bound holds for ||Po^i+jfc,i|| if * > 0. Applying this and the 
inequality min(a + b, c) < min(a,c) + min(6, c) for a,b,c> 0, (11) follows 
readily from (13). □ 

It follows from Proposition 4 that if both the \ wi\ and the are summable, 
then (3) holds; if J2iZo su Pi>i^e < °°> then (7) holds. 

3. Nonlinear time series. Let {e'j} be an i.i.d. copy of {£j}- We say 
that X n = F(Zi n ) is geometric moment- contracting if there exist a > 0, C = 
C(a) > and < r(a) < 1 such that 

E{\F(. . . ,£_i,£ ,£i, . . . ,£ n ) 

(14) 

-F(...,£'_ 1 ,e' ,e 1 ,...,e n )\ a }<Cr n (a), n E N. 

Without loss of generality, let a < 1 since otherwise we can employ the 
Holder inequality. We may view X' n := F{ . . . , e'_ 1 , e' Q , e±, . . . , e n ) as a coupled 
version of X„ . 
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Condition (14) is very mild, and is satisfied by a wide class of nonlinear 
time series. Note that geometric moment contraction does not even require 
mixing (see Example 1). An important special class of (1) is the so-called 
iterated random functions such that (14) is satisfied. Let X n be defined 
recursively by 

(15) X n = F(X n _ 1 ,e n ), 

where F(-,-) is a bivariate measurable function with the Lipschitz constant 

fit* r \F(x,e)-F(x',e)\ 

(16) L £ = sup ; < oo 

x'=£x l*^ \ 

satisfying 

(17) £(logL £ )<0. 

Then the Markov chain (15) admits a unique stationary distribution if 
E(Lg) < oo and E[\xo — F(xo,e)\ a ] < oo for some a > and xo [Diaco- 
nis and Freedman (1999)]. The same set of conditions actually also imply 
the geometric- moment contraction (14) [cf. Lemma 3 in Wu and Woodroofe 
(2000)]. The condition (17) indicates that the iterated random function (15) 
contracts on average, which is satisfied for many popular nonlinear time se- 
ries models such as TAR, RCA, ARCH and EAR under suitable conditions 
on model parameters. 

Recall that Z k = (.. . ,e k -i,e k ) and Z k ,e = (e k -e+i, . . -,e k ). Let Z' k = (.. . ,e' k _ x ,e' k ). 



Lemma 5. The geometric moment- contraction condition (14) holds if 
and only if there exist F±, F%, ■ ■ ■ , with each F# being an i-variate measurable 
function, such that, for some C < oo, 

(18) E{\F(Z k )-F e (Z k)e )\ a }<Cr e (a), I G N. 

Proof. The direction. Assume (14). Then for each £, there exists a 
realization Z' = z such that E(\X e - X' e \ a \Z' = z ) < Cr e (a). So (18) holds 
by defining i^(-) = F(zq, •), which is clearly measurable. The "<^=" direction 
follows easily from 

E(\X e - X't\ a ) = E[\F{Z k ) - F(Tl k _ t , Z M )|°] 

< E[\F{Z k ) - F t {Z k>i )\ + \F{Z' k ^,Z k/ ) - Fz(Z k/ )\] a 

< E[\F(Z k ) - F e (Z k/ )\ a ] + E[\F(Z' k _ e , Z M ) - F e (Z k/ )\ a ] 
= 2E[\F(Z k )-F l (Z k/ )\ a ), 

where we have applied the inequality \a + b\ a < \a\ a + \b\ a for < a < 1. □ 
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In Lemma 5, we can often choose zq arbitrarily in denning F(_. This can 
be illustrated by the correlation integral example in Theorem 7. 

We remark that conditions similar to (14) and (18) have appeared in the 
literature. Denker and Keller (1986) assumed that F is Lipschitz-continuous 
in the sense that there exists a p S (0, 1) for which 

(19) \F{.. .,z n -i,z n ) - F(. . .,z' n _ x ,z' n )\ < const. • p n 

if z\ = z[, . . . ,z n = z' n . For the two-sided extension, see Definition 1.3 in 
Borovkova, Burton and Dehling (2001). Comparing with our condition (14), 

(19) is stronger and it does not allow models like X n = pX n ^\ + e n , where 
|p| < 1 and the random variables e n are i.i.d. with unbounded support. 
Borovkova, Burton and Dehling (2001) proposed a weaker version of (19), 
termed r- approximation condition, which requires 

(20) d l (r):=E\X -E(Xo\e-i,...,£i)\ r ^O as oo, 

for some r > 1. To make this weaker version operational, one needs to im- 
plicitly assume J5|-Xo| < oo, which excludes the case that e n does not have 
a mean. Our formulation has the advantage that heavy-tailed distributions 
are allowed. 

As before, write X{ = F(Zi), and for a fixed choice of ^-variate function 
Fi from Lemma 5, define X{ = Xn = F^(Zj^); let Y% x ,i % = K{Xi 1 ,Xi 2 ) — 
E[K(X il ,X i2 )] andY ilji2 = K(X h , X i2 )-EK(X il , X i2 ) and recall thatL i:li j 2 = 
Wi 1 _j 2 l^ li j 2 and Li lt i 2 = Wi 1 -i 2 Yi lt i 2 . Then Theorems 1 and 3 imply (i) and 
(ii) of the following result, respectively, in view of Proposition 4. 

Theorem 6. Suppose that for each £>1, there exists an i-variate func- 
tion Fi such that (18) holds and J2iZo^Pi>i^£ < oo. 

(i) If J2 \ w i\ < 00 ; then 

n 

n ~ 1 ' 2 E m-jlK^Xj) - EKiX^X^A^a 2 ) 
for some a 2 < oo. 

(ii) Let X^il^il = 00 ■ If we a ^ so have ]immi n - ¥00 W n /(X)iLo \ w i\) > 
and J2k=o(. n ~ ^) w k = °( n W 2 ), then 

n 

(nW 2 )- 1 / 2 J2 w.^ j [K(X i ,X j ) - EKiX^X^^^a 2 ) 
for some a 2 < oo. 
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The inequality (18) implies that the distance between Xi and Xi^ decays 
exponentially fast to in t. Thus under certain continuity conditions on K, 
5i is expected to vanish sufficiently quickly. For an application of Theorem 
6, consider the correlation integral 

n 

N b = /J ^X^-X^Kb, 
tiii2=l 

which measures the number of pairs (Xi,Xj) such that their distance is 
less than b > 0. Correlation integral is of critical importance in the study of 
dynamical systems; see Wolff (1990), Serinko (1994) and Denker and Keller 
(1986) for further references. 

Theorem 7. Suppose that X n defined in (1) satisfies (14), and for some 

K>1, 

(21) sup P(x<X -X j <x + T)<Clog~ 2K T~ 1 

for allO<T< 1/2. Then [N b - E(N b )]/n 3 / 2 -^N(0,a 2 ) for some a 2 < oo. 

Proof. Let K(x, y) = l\ x -y\<b an d Wi = 1- By Lemma 5, for each t there 
exists an ^-variate measurable function i^(-) such that (18) holds. Next we 
shall verify that (18) together with (21) implies that 5i = 0(£~ K ), which is 
summable and thus completes the proof in view of (ii) of Theorem 6. To 
this end, let u = r{a) l ^ 2 °^ < 1. Then by (18) and the Markov inequality, 
U\x -x \>u*f < u- ae E\X - X \ a < Cr(a) e / 2 , where, as usual, X = X i>e . 
For any < u < 1 , observe that 

1 1 [ K (X , Xi ) - K (X , X )] l max{ | Xo _ ^ q | _ | x . _yr . | j <ue 1 1 

<P 1/2 (b-2u e < \X -Xi\ <b) + P 1/2 (b< \X -Xi\ <b + 2u e ) 
which, by (21), is bounded by 2C 1 / 2 log - "^) -1 = 0(£" K ). Since \K\ < 1, 
WiKiX^X^-KiX^XM 

< \\[K(X ,Xi) ~ K(Xo,Xi)]l m ^ {lXo _ Xo ^ lXi _ Xil)<ui \\ 

+ \\[ K ( x o,Xi) - K(X ,Xi)]l m&x{lXo ^ XolAX _ Xil) ^ ue \\ 

= o(r K ) + o[r( a y/% 

proving that dg = 0(£~ K ). □ 

Example 1. Let X n = (X n _i + e n )/2, where e n are i.i.d. Bernoulli ran- 
dom variables with success probability 1/2. Then X n admits Uniform(0, 1) 
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as a stationary distribution. This process is not strong mixing. Now we show 
that (21) is satisfied. Assume j > 1. Let U = YL\=\ E il'^ > ~ % ■ Then U is uni- 
formly distributed over {0,1/2-' ,..., (2? - 1) /2* } and Xj = X /2? + U. Hence 

(21) holds in view of 

P(X <X -Xj<X + T) 

= EP[(x + U)/(l - 2~ J ) < X < (x + t + U)/(l - 2~ j )\U} 

<r/(l-2~ j )<2r. 

The process X n is related to the doubling map Tx := 2xmod 1 in the follow- 
ing way. Let Yq be a Uniform(0, 1) random variable and define recursively 
Yi = 2Yi_i mod 1 for i > 1. Then (X\ , . . . , X n ) has the same distribution as 
(Y n ,...,Yi) and hence 7V& and Mb = Yaj=i y^-|<6 are identically dis- 
tributed. The limiting distribution of the empirical [/-process {Mb, < b < 
1} was discussed in Borovkova, Burton and Dehling (2001); see Section 6 
therein. 

Our Theorems 6 and 7 are closely related to certain results by Borovkova, 
Burton and Dehling (2001), which considered nonweighted [/-statistics for 
two-sided processes X n = F((e n+ k)kez)i where (e n )nez is stationary and 
absolutely regular (or weak Bernoulli). To make a specific comparison, we 
state here their Theorem 7, a central limit theorem. Let 

Pk = 2sup{sup{P(A|ei,. . . ,e n ) — P(A) :A is a(e n+k ,e n+k+ i, ■ ■ .)-measurable}} 

n 

be the mixing coefficients, let a k = [2 J2i^k ^fc(l)] 1 ^ 2 with c4(l) defined by 
(20), and let K(-, ■) be a bounded, symmetric function such that 

sup E{\K(X ,X k ) - K(X',X k )\l\ Xo _ x ,\< T } < 0(r) 

l<fc<oo 

with lim T ^o l ^( r ) = 0, where X' is identically distributed as Xq and X^ is 
interpreted as an independent copy of Xq. See Definitions 1.2, 1.4 and 2.12 
in Borovkova, Burton and Dehling (2001). Then the asymptotic normality 
of U n holds provided 

oo 

(22) 5> 2 (/? fc + a fc + <A(a fc ))<cx). 

k=l 

This result has a number of similarities with our Theorem 6. However, 
the two results do not imply one another. Theorem 6 assumes one-sided 
processes with i.i.d. innovations while their result allows the innovations to 
be two-sided and weakly dependent; on the other hand, Theorem 6 allows 
unbounded K, general weights Wi and process X k for which the mean is 
infinite, whereas their result requires K to be bounded, Wi = 1 and < 
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oo. Let us make a more specific comparison in the context of Theorem 7 of 
the present paper where both results are applicable. Applying the central 
limit theorem in Borovkova, Burton and Dehling (2001) to iVj, for one-sided 
processes with i.i.d. innovations satisfying (18) with a = 1 and (21), we have 
f3 n = and a n < Cp n for all n > 1, where p G (0, 1). By Example 2.2 in 
Borovkova, Burton and Dehling (2001), 

0(r)= sup P{b-T<\X -X k \<b + T)<2Clog- 2K T~\ 

l<fc<oo 

Thus condition (22) is reduced to J2kLi ^ log~ 2K (l/p fc ) < oo, namely k > 
3/2, which is stronger than the condition k > 1 imposed in Theorem 7. 

4. Short-memory linear processes. Let (a n ) n >o be square summable, let 
(£n)n>o be i.i.d. random variables with mean and finite variance and let 

oo 

(23) X n = ^2 O-iEn-i- 

i=0 

If J2i^o \ a i\ < o°] then the covariance function T(n) = E(XQX n ) is summable 
and we say that X n is short-memory. In this section, let Yij = K(Xi,Xj) — 
EK(Xi,Xj). For short-memory processes, we shall utilize the linearity struc- 
ture and provide conditions on K(-, •) such that (3) and (7) hold by comput- 
ing the quantities 8ij in (2) and Oij in (7). In Section 5, we shall discuss the 
case when X n is long-memory, which has a very different flavor. Note that 
J2e^o < oo is n °t guaranteed by Y^Lo \ a i\ < °°) an d we need more refined 
computations, which is feasible by the linearity of X n . 

Let en = aj(i <£),X n = J2Z and Y itj = K(X h Xj) - EK^Xj). 

Also define = Xij lt j 2 — a^o + di^'o, where the truncated process 

J H a i-i e v -°° ^ 31 < h < oo, 
{0, -oo < j 2 + 1 < Jl < oo. 

Define the convolutions 

Ki x ,i 2 ,j{xi,X2) = EK{x\ +Xi 1 j + l oo>%2 + Xi 2 j + i oo), 

(24) 

K iljj (x 1 ,x 2 ) = EK(xi + X ili j +1>00 ,X2), xi,X2 G R 

Let K ilt i 2tj , K iu j, X itjuj2 and X[ ji h be defined similarly to K h)i2t j, K iuj , 
Xij lt j 2 and X[ , - 2 with Zj replaced by Zj. 

PROPOSITIONS. Assume that sup i j\\K(X i ,Xj)\\ < oo and sup,^ \\K(Xi, 
Xj)\\ < oo. Further, assume that there exist hq G N and C < oo such that, 
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for all ii,i2,£ > no, 

11^1,12,0(^1!, -oo,0' -^'2,-00,0) ~~ ^ii,i2,o(^ii,-cx),0) -^12,-00,0) 

< C(\a h \ + \a i2 \) 
and 



(25) 



(26) sup 11^,0(^,-00,0, -X"*)- ^,0(^,-00,0, X k ) || <CK I . 
fe<-i 

T/ien £/je following hold. 
(i) 



< 



fCfloil + |aj|), i,j>n , 

\ C\a,i\, i > no, j < 0. 



(ii) There exists some constant C < 00 such that 

00 00 

SUp^^i-fc < C"^ SUp I <Zi|, 



fc>0 



i=0 



and for any e > 0, 

00 

sup^min(0 M _ fe ,e) < C 



k>0 



i=0 



e + ^ mini sup |a,|,e ) 
i=o ' 



Proof. Fix i±, i<i > uq. First we remark that if I < uq, then the left-hand 
sides of (25) and (26) are both equal to so that the inequalities trivially 
hold. Writing 

E [ K ii,i2,o( X 'ii,-oo,Oi X 'i 2 ,-oo,o)l Z o] = ^ii,i 2 ,-l(^u,-oo,-l,^i2,-oo,-l), 

we have by Cauchy's inequality that 
11^(^,^)11 

— l|-^il,£2,o(-Xii,— oo,0, -Xi2,— 00,0) ^Ql ,12, — 1 ,— OO, — 1 , X^,— 00, — l) II 
= llElKi^izfl^X^-^OiXiz-oofl) - -^i 1 ,j2,o(^,- 1 ,-oo,0,^ 2 ,-oo,o)l Z o]|| 
^ ll- K il,i2,o(-X'ti -00,0,^2,-00,0) ~~ ^1,12,0(^1-00,0,^2,-00,0)11- 

Similarly 11^(^,^)11 < ll^n.o^.-oo.o.**) -^i,o(A,-oo,o,^)||, which 
completes the proof of (i) in view of (25) and (26). 
The first inequality in (ii) follows simply from 

00 fc— 1 00 00 

X! Qi,i-k = X! + — C + 2 sup | a; | , 

j=0 i=0 i=0 i=n J - 1 

whereas the second inequality there can be derived similarly. □ 
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Remark 3. Note that X- x _ oc — X^^^-i = ai 1 (e' — £o)i conditions 
(25) and (26) can be interpreted as the "Lipschitz continuity" of K^^fi and 
Ki lt Q. Discontinuous functions K are allowed since these are convolutions of 
K and the distribution functions of (^^,1,001^2,1,00) and re- 
spectively. For example, if K is a bounded function and f E , the density func- 
tion of £1, satisfies / \f' £ {t)\ dt < 00, then it is easily seen that (25) and (26) 
hold. Observe that degree of smoothness of the distributions of the above 
random vectors increases with «i,Z2- Thus, by only requiring (25) and (26) 
to hold for large i\,i2, an additional dimension of flexibility is in place. 

Proposition 8 together with Theorems 1 and 3 immediately yield 

Theorem 9. Assume that J2i^=i sw Pj>i \ a j\ < 00 • Also assume thatsup i j \\K(Xi,Xj)\\ < 
00 and sup i \\K (Xj, Xj)\\ < 00 and that the regularity conditions (25) and (26) 
hold. 

(i) IfY,iZo\ w i\ < 00 > then 

n 

n ~ 1/2 E Wi-j[K{X u Xj) -EK{X u Xj)} ^N(0,a 2 ) 
for some a 2 < 00. 

(ii) Suppose that \ w i \ =00 with lim inf ^fF^", — , > and 2~^fc=o( n — 
k) w 1 = °{ n W 2 )- Assume also that lim^_>oo 5? = 0. Then 

n 

{nWlY x l 2 m-jiKiX^Xj) - EKiX^X^SN^a 2 ) 
for some a 2 < 00. 

Remark 4. Consider the special case in which a n = n~@ for n > 1 and 
[3>1 and e, are i.i.d. standard normal random variables. Then by (20), 
d n (l) ~ cin 1 / 2 "' 3 for some c\ > 0. Here j n ~ (3 n is meant as lim™-^ 7 n //3 n = 
1. So a n ~ C2n 3//4_ ^ / ' 2 and the condition (22) of Borovkova, Burton and 
Dehling (2001) necessarily requires Ylm=i n ' ' a n < 00 j or /? > 15/2. In com- 
parison, our Theorem 9 only imposes (3 > 1. 

5. Long-memory linear processes. In (23), let a, = j~^L(j)I(j > 1) for 
some j3 € (1/2, 1) and slowly varying function L. Thus, aj is regularly varying 
at 00 with index —(3. This represents a rich class of processes. In particular, it 
contains the important time series model fractional autoregressive integrated 
moving average (FARIMA) process. See Granger and Joyeux (1980). Note 
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that {X{\ is long-range dependent in the sense that the covariances are not 
summable [cf. Beran (1994)]. 

LetX: = {0}U{k=(fci,...,Jb r ):r = l,2,...,Jfei€{l,2}} and let |k| be the 
length of k (|0| =0). We assume throughout the section that integer p > 1 
satisfies 

oo 

(27) £ n-^ +1 ^ 2 \L(n)\P +1 < oo. 

n=l 

Condition (27) allows simultaneous consideration of two cases: (i) (p + 1) 
(2/3-1) > 1 and (ii) (p + l)(2/3 - 1) = 1 and Y£=i \L P+1 (n)\/n < oo. Case (i) 
has been widely studied [see Ho and Hsing (1997)], while the boundary case 
(p + l)(2/3 — 1) = 1 has been overlooked in the literature. Our approach 
allows us to investigate the boundary case for which the limiting behavior 
depends on the growth of the slowly varying function L. 

Denote by C P (R 2 ) the class of all functions g such that the partial deriva- 
tives D^g = d T g/dxi tl ■ ■ ■ dxk r exist for all k = (k±, . . . ,k r ) £ /C for which 
|k| < p. For each h,i2, let 

p r 

Y ilti2 =K(X h ,X i2 )-J2J2 D i K h,i2,-oo(0,0) EK.-* e i. 

r=0\\\= r ji>">> s=l 

and Li lt i 2 = w^-^Y^^. Let Yij,Lij be defined as Yij,Lij with d{ = ail(i < 
£) replacing a«. Let K iui2 j, K iuj , Xij uj2 , K ilii2ij , K iuj and X iijuj2 be de- 
fined as in Section 4. 
Write 

n p 

(28) U n = -kii,^ ~r~ Zn,n 

*l,j2=l r=Q 

where 

n r 
Z n ,r-=Y1 W il _ ia A-K'ii,ia,-oo(0,0) J2 W a H s -3s^js- 

|l|=r*i)i2=l ji>— >jrS=l 

Observe that Z n>r , 1 < r < p, are well-structured, and can be shown to 
follow non-central limit theorems under mild regularity conditions on the 
D\Hi lt i 2 ,-oo(0, 0). Our main results, Theorems 10 and 11, show that the nor- 
malized 12=1 ^h,i2 follows a central limit theorem under mild conditions. 
These two pieces of information will then combine to give a comprehensive 
picture of the asymptotic behavior of U n . We refer to i 2 =i ^h,i2 an d 
J2r=iZn,r, respectively, as the short- and long- memory components of U n . 
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We now state the technical conditions for our main results. In the follow- 
ing, let 

oo 

A i (k) = ^a k jl fc = 2,4, i>0. 
j=i 

(Kl) There exists no G N such that K^^p^, •) G C P (R 2 ) when ii,?2 > n-o, 
and for all k G /C with |k| < p, 

oo,o)|Z-l] 

(29) 

— OO,— 1 , 

X i2 _oo _i). 

(K2) For ii,*2 > n o, there exists C < oo such that, for all k G K, with 
|k| < p, 

\\D] l Ki 1 j 2j o(Xi u -oo, 0, -2Q 2 ,-oo,o) ~~ ^k-^ii,i 2 ,-l — OO, — 1 ) X i2 -oo ,-l) 

(30) — (VDk-f^n,j 2 , -1(^1 -00,-1,^2,-00,-1), («neo,Oi 2 £o))|| 

<C(a?+4), 
and, for |k| = p, 

H-Dk-fQi,^, o(-^ii,-oo,0, ^2,-00,0) ~~ -Dk-fQi,j 2 ,-l 0,0)- 

(31) 

<CK(2) + A 2 (2)]. 
(K3) For i\ > no, there exists C < 00 such that 

sup 11^(^,^)11 
fc<-l 

(32) " 

= sup ||iiL" il) i(Xi 1) _ 00i o,Xfe) — ^^(Xj^-oo^ijXfe)!! ^CjajJ. 
fc<-l 

Similarly, we define 

(K4) There exists no G N such that, for each y, Ki t o(-,y) G C P (R) when 
i\ > no,«2 < — 1, and, for all k < p, 

(33) E[K^'q (Xi-t -oo t o,Xi 2 ) I Z_i] = Kjf+i.oC^ti ,-<» ,-i> ^Q 2 )- 

(K5) For ii > no, «2 <■ — 1, there exists C < 00 such that, for all k < p, 

\\ K h'o ( x h,-oo,o, X i2 ) - K^, xiX^ oo l,Xi 2 ) 

(34) 

~~ (^1,-00,-1,^2), a^eoW < Ca% v 

and, for all k < p, 

(35) 114^X^,0,^) - K^\(0,X i2 )f < CA h (2). 
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(K6) 



(36) 



lim sap\\K{X 1 ,X j )-K(X 1) X j )\\=0. 



Remark 5. The conditions (Kl) and (K4) state that we can interchange 
the order of differentiation and integration. The other conditions are smooth- 
ness conditions on K^^^q and K^ q for large ii,i2- For the latter, Remarks 
3 and 4 are still relevant. These conditions are left in the form in which 
they are directly applied in the proofs. Finding sufficient conditions that are 
easy to work with in specific contexts should be straightforward. See Ho and 
Hsing (1997), Koul and Surgailis (1997) and Giraitis and Surgailis (1999). 

In the following, we consider two special cases of J2i I w % I < 00 an d J2i I w i I = 00 • 
Generalizations are possible at the expense of additional details. 

Theorem 10. Assume thatj^i^i ^ii 1 ' 13 L{i)\ < oo, E(e\) < oo and sup,,- ||iT(X ,Xj)|| < 

oo . Then under the regularity conditions (Kl)— (K3), we have n ^ 2 X/i<j 1 ,i 2 <n 
for some a 2 < oo. 

Theorem 11. Letwi = 1, and assume thatE{e\) < oo, and supjj \\K(Xi, 
Xj)\\ < oo, sup^ \\K (Xi, Xj)\\ < oo. Then under the regularity conditions 

(K1)-(K6), we have n~ 3 / 2 J2i<i 1 i 2 <n ~* N(0, cj2 ) f or some a 2 < oo. 

We conjecture that Theorems 10 and 11 can be made more general by 
dropping the restrictions on the W{. While that generality is not achieved in 
this paper, the two theorems do already cover a wide range of interesting 
results. In particular, numerous limit theorems for the partial sum in the 
context of long-memory linear process [cf. Ho and Hsing (1997)] are special 
cases. 

As explained earlier, the asymptotic distribution of U n is determined by 
one term or a combination of terms on the right-hand side of (28). The 
asymptotic behavior of U n is described by Theorems 10 and 11, while those 
for the "non-central" terms Z n ^ r are typically more straightforward but must 
be considered case by case. Let us take any l\, . . . , l r E {1,2} with 1 < r < 
p and (r±,r2) = (p,q), and consider two special cases for the purpose of 
illustration. 

First let us consider the case where \wt\ is summable. Note that under 
general conditions we expect 



K o,t, 



—oo 



{xi,x 2 ) —>G(xi,X2) :=EK(Xi + x\, X 2 + x 2 ) 
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where X±,X2 are i.i.d. that have the same distribution as X±. Hence we 
assume that the w^-Dzi,...,z r .-^i,t,-oo(0, 0) are absolutely summable in t and 

li^(^oA 1 ,...^^iA-oo(0,0) + 2^^A ll ...^^o > t,-oo(0,0)] 
?wo ° \ t=l J 

= C 6 ( — 00, 00). 

Then it is not difficult to see [cf. Surgailis (1982) and Major (1980)] that 



]T w h - i2 D hr .. t i r K ilti2 -oo(0,0) Yj II "' 

l<il,i 2 <n ji>— >j r s=l 



and 



: 12 ( 12 w h-i2 D li,.4r K h,i2,-oo(0,0)Y[a ils ^ js 

ji>--->jr \l<ii,i2<n 

n . r "I 2 



c 2 12 



jl>—>jr lt=l S=l 



■3s 



C 2 n 2-r(2P-l) L 2r {n) f f l ^(x - u s )^ dx 

Jui>-- >u r J x=0 g _j 



dui ■ ■ ■ du r 



11 



-l+r09-l/2) L -r (7l ) 



(37) 



x w il -i 2 D h ^, : i r K ilj i 2 - oo (0, 0) II^-A 

l<ii,i2<n ii>— >jVs=l 

»1 r 



\C\ 



«!>■■■>«,. 



/ ~\{x-u s )^fdx 

JX=0 3=1 



dB(ui)---dB(u r ) 



where the limit is expressed in the form of a multiple Wiener-Ito integral 
with B denoting standard Brownian motion and y+ = max(?/,0). Note that 
applying Theorem 10, the rate of i 2 =i ^1,12 m this case ^ s n ^ 2 , which is 
lower than that of £i< il)ia < n Ai,..,z r #M2,-oo(0, 0) V ; ... , ; J]J=i 1 



l il s -js £ js- 

As a second example, we consider an application in connection with The- 
orem 11 by assuming 

(38) Wi = l and Ai,...,/ r -K"o,t,-oo(0, 0) -»Cs (—00,00). 

Under this assumption, 



12 Ai,...,J r -K'ii,i a ,-oo(0,0) II" - 

l<ii,i2<n ii>->jVs=l 
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c 2 n 4-r(2/ 3 -l) L 2r (n) 



u±>--->u r 



/ / I _ u *)+^ dxi dX2 ' 

Jxi=0 JX2=0 



dui • ■ ■ du r 



and 



n 



(39) 



-2+r(/3-l/2) 



|C| 



L- r (n) x; A 1 ,..., lr if«,i a ,-oo(o,o) x n 



KiiA 2 <n 



jl>—>jr s=l 







j 


r r (xa 


>U\ >"->1t r 


Jx 1= Jx 2 =0 



(X 2 -X!)-^ 



^(x/ 3 - u s )jf dxxdx 2 



dB(ui)---dB(u r ). 



Now Theorem 11 implies that the rate of ^™ i 2 =i -^*i,«2 * s ra3//2 ? which is dom- 
inated by that of Ei<n,i 2 < n Di u ... jir K ilth ,-oU°> °)£ii>...>ir II.- i - 

In view of these examples and Theorems 10 and 11, in (28) one can 
refer to Ya^ i 2 =i -^n,^ the short-memory component, and Ef=i %n,r the long- 
memory component of U n . 

Numerous applications result from these two simple cases. The following 
are some illustrations. 

(a) Sample covariance function. Suppose 

n 

U n = Y,T{X i )T{X l+k ) 1 
i=i 

where T is some function. n~ 1 U n is an estimator of E[T(X\)T(Xi + k)]. So 
Hi u i 2 (xi,x 2 ) = Wi 1 -i 2 K(x 1 ,x 2 ), where w\ h _ i2 \ =I(\i\-i2\ =k) and K(x\, x 2 ) 
T(xi)T(x 2 ). Thus Theorem 10 applies, where the asymptotic distribution 
rests on T. Let us consider an example by assuming k > 2, T(x) = x 2 and 
E{XiX 3 l+k ) = 0,j = 1, 2. It is easy to see that 

if t = k and 

(*i,...,l r ) = (l,l) or (2,2), 
0, otherwise. 

If p E (3/4, 1), then £f =1 Z n<r . = and hence 



w t A l! ...,z,#o,t-oc(0,0) 



f 2E{X 
I 



-^(U n -EU n )^N(0,a 2 ). 



n 



If /3 E (1/2,3/4), then Ef=i ^n,r is dominated by the term 

r 

1,12,— OO (o,o) e n 

l<il,i 2 <n ji>-->jrS=l 



Js 
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n—k 

= 8E(Xf) ^2 ^2 ai-j!ai+k-j 2 £ ji £ j2- 

»=1 31>3r 

Hence (37) and the discussion leading to it give 

[8 J E(X 1 2 )]- 1 n- 1+2 ^ 1 /2) L -2 (n)([/n _ EUn) 



d 



Ul>U2 



[(x - ui)+(x - u 2 )+] 13 dx 

x=0 



dB{ux)dB{u 2 ). 



(b) U- and ^-statistics. The asymptotic distribution of statistics of the 
form 

]T K(X h ,X i2 ) or J2 K(X h ,X i2 ) 

can be considered using Theorems 10 and 11. Note that the partial-sum 
theory developed in Ho and Hsing (1997) is readily recovered here by letting 
K(x\,x 2 ) = (h{x\) + h{x 2 ))/2. We give another example here, the Wilcoxon 
one-sample statistic and the signed-rank statistic, for which the asymptotic 
distribution is not seen elsewhere. 

Let 

K il:i2 (xi,x 2 ) = I(x 1 + x 2 > 0). 

Then [n{n — l)]" 1 Xa<u^j 2 <n -^(^n > -^2) ^ s cane d the Wilcoxon one-sample 
statistic. Let us assume for simplicity that £j has a normal distribution and 
that the marginal of {X{\ is standard normal. Simple calculations give 

Ki h -oo{xi,x 2 ) = P(X h +X i2 > -(xi + x 2 )) = 1 - $( . * 1 + X2 

V v /2( 1 + Pn-i 2 , 

where p n = EX\Xi +n and <fi is the standard normal p.d.f. Hence, 

'<'S-~ (0 '°> - 7wh^ m ^ " A - 00 

With C= 1/(20F) in (38), it follows from (39) that, for each (3 £ (1/2,1), 
(40) n^ h l 2 L- x {n)(TJ n -E\J n )^^= I [ [{x - u) + ]~^ 'dxdB(u), 

V 71 " Ju£?RJx=0 

which has a normal distribution. A related statistics is the signed-rank statis- 
tic 



i=l 
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where \Pj = sign of Xj and Rf = the rank of among \X±\, . . . , \X n \. It 
can be shown [cf. Randies and Wolfe (1979)] that W n = U n (l + o p (l)) and 
hence the asymptotic distribution can be derived in exactly the same way. 

There are situations where i 2 =i^*i.*2 as wen as ^n,r>l < r < P-, an 
equal zero. Then the asymptotic distribution will be determined by the 
lowest-order non-trivial Z n ^ r . This is exemplified by certain [/-statistics with 
degenerate kernels, kernels which satisfy / K{x\,X2) dF{x\) = for all xi- 
See Dehling and Taqqu (1989, 1991) and Wand Hsing (1996). The approach 
in those references overlaps and complements the approach described here. 

6. Proofs 



Proof of Theorem 1. Let 

t+m n 
Cm(Zt) = and S n(£m) = ^2^ m (Z t ), 



m > 1. 



i—t—m 



t=l 



oo t+m 



oo m 



Then (5) implies 

oo 

YJPo{u{^t))\\<Y, E \^t-i\et,i<J2 E H\ot,t-k «*> 

t=0 t=0i=t-m t=0k=-m 

which entails S n (£ m )/y/n => N(0, a%^) for some a m < oo by Theorem 1 in 
Woodroofe (1992) [see also Lemma 5 in Wu (2003)]. 

Let LIM be limsup m _ >co limsup n _ >00 . It remains to verify that 



LIM 



1 



E L h , 

l<ii ,i2<n 



LIM 



1 



E 



L 



1*1 — Z2 1 >m, l<ii,i2<n 



0. 



(41) 

To this end, note that the projections Vt are orthogonal and hence 



E 



L 



11,12 



1*1— i2\>m, l<ii,i2<« 



(42) 



E 

f =— oo 
n 

< E 



E 



L 



11,12 



|ii — *2 1 >iti, l<ii,42<'* 



E \\VtLi u i 2 \ 

«2|>m, l<ii,i2<n 



<2 E+ E 



, t=l t=—ooJ 



E E w^tUui^ 

12=1 ii=i2+m+l 



Making use of the fact that VtLi lt i 2 = for t > i\ V 12, 



EE E 11^,2 II =E 

t=l \ 12=1 «i=i 2 +»Ti+l / t=l 



n—1 n—k 



E E WPtL i+k ,i\ 

k=m+l i=t—k 
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(43) 



E 



E ^2\ w k\Qi,i-k 

k=m+l i=0 



< n 



n— 1 n 



E E^l^v-fc 
fc=m+l i=0 



and similarly, 



E 

t=— oo 



n n 



E E WPtL h> 

12=1 ii=i2+m+l 



(44) 



o 

E 

t=— oo 



n— 1 n— k 

E E ll^t^+fc.il 
fc=m+l i=l 



2 oo 

<E 

t=0 



n— 1 n— fc 

E E l^fcl^i+fc+M+t 

fc=m+l i=l 



oo n oo 



oo ra— 1 n— k 

— C E E E l^fcl^+fc+M+i — C E EE \ w k\0i+k+t,i+t 
t=0k=m+li=l k=m+li=lt=0 



oo n oo 



oo oo 



^ C E EEKl^i-fc =(7n E Ei^i^-i-fc' 

fc=m+l i=l j=0 fc=m+lj=0 



in view of 



oo oo 



n—1 n—k 

E E l^fcl^+fc+M+t — E E ®i+k+t,i+t 

k=m+l i=l k=l i=l 



oo oo 



^ E l^fcl E^'.i- fe = : c < °°- 
fc=l i=l 

Hence (41) follows from from (3), (42)-(44). □ 

Proof of Theorem 2. By Cauchy's inequality, we have \\Po(L ilii2 - 
Li lt i 2 )\\ < {w^-i^Sg. By the triangle and Cauchy's inequalities, we also have 

WPo(Li u i 2 — Ai,i 2 )l| — C\ w h-i 2 \^h,i2- 

Thus there exists a C > such that, for all ii,i2,£, 

(45) H'Po^ii.ia -iii,i 2 )ll < Ckii-i2l min (^ii,i2 5 ^)- 

In the sequel let LIM stand for limsup^^ limsup n ^ oc , and let C stand for 
a constant which may vary from line to line. By the proof of Theorem 1, we 
have 

2 



A :=LIM- 



1 



n 



W2 



E/ (-^n,«2 ^ii,ia) 
l<ii,i2<n 



< I + II, 
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where 



i=lim— 

nW 2 ^ 



n—ln—t 



Yl E \^k\ m ^i,i-k,Si) 

k=0 i=0 



(~i oo 

h=lim— 

nW 2 ^ 

By the assumptions, we have 



n— 1 n+t 



]T y \w k \mm(6 it i-k,5£) 

k=0i=l+k+t 



I < C lim sup sup y min(^ i , ^) = Clim sup ^ min(6^_fc, J) = 



*oo fc>0 



i=0 



5^ok>o i=0 



and 



C 



oo /n— 1 n+t 



' n— 1 oo 



II<LIM— — y y liOfclmin^-fe,^) I ^^Iwfelmin^i-fe,^) 

nW n t=0 \ k=0i=1+t J \ k=Qi=1 J 

/ oo \ 2 

<Clim supVmin(4,t-fc,^) =0. 
s ^°\k>o i=0 J 

Thus A = follows. □ 

Proof of Theorem 3. The plan of the proof is to show that, for every 
fixed £> 1, 



(46) {nWlY^ J2 L n , i2 4N(0,a 2 ) 

l<ii ,i2<ra 



as n — > oo 



for some finite a 2 . It follows then from Theorem 2 that a 2 is Cauchy in 
£ and hence converges to a finite constant as ^ — ► oo. By this and another 
application of Theorem 2, we conclude that (9) holds with a 2 = lim^oo a 2 G 
(0, oo). Thus, we will focus on proving (46) for a fixed £. Observe that 



l<ii ,i2<n,\ii— «2 1 <■£ 



ofv^f^lW^ 2 ) 1 / 2 ] 

V k=0 ) 



since W n — > oo. Thus it suffices to show that 

r2\-l/2 t d 



(47) (n^ 2 )" 



E 



N(0, <r ) as n — > oo 



l<jl,«2<rt,|il— 22 1 >^ 



for some finite a 2 . Fix i±,i2 with |ii — 22! > and observe that Zj x and Zj 2 
are i.i.d. Now define ^^^(Z^) = E'fLj^jZjJ and 
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Let i\ = r\ + q\l and 12 = ^2 + where integers < r±,r2 < ^ — 1 and 
< gi 7^ g2 < <Z = L n /^J- Since R ri+qi £ :r2+Q2 i are uncorrelated for different 
pairs qi < q 2 , 



Rri+q\l,r2+q2t 

l<gi<92<9 



1<91<92<<? 



which in conjunction with Cauchy's inequality by summing over ?T,r2=0, 
...,£— 1 yields that 



l<il,i2<n,jii— 22 1 >^ 



So (47) will follow from 



- C E X! u; r 1 + g i£-r 2 -g 2 ^ 

n 

= C E <-^< CY,(n-k)w 2 k = o(nWl). 

I<ii,i2<n fc=0 



(48) {nW^y 1 ' 2 53 4, 2 (Z n )4N(0,a 2 ) 

l<H)»2<fi)|ii— ii\>t 



as n — > 00 



for some finite <t 2 , which can be easily shown to hold by the central limit 
theorem for ^-dependent processes. □ 



Proof of Theorem 10. We verify (3). By Lemmas 12(iii) and 13, we 
have, for k > 0, 

00 00 00 

E ^M-fc - CE = C Y1 ^ < 00 • 

i=fc i=k i=0 

Next for i > and j < -1, we have by (K3) that \\V K (X h Xj)\\ = 0(\ai\) 
uniformly. It is easily seen that, for the "well-structured" part K(Xi,Xj) — 
Yij, we also have \\Vo[K(Xi,Xj) — Yij]\\ = 0(\a,i\) uniformly. Hence, for 
k>0, 

fc-i fc-i 
by Karamata's theorem. Hence 

00 k— 1 00 

^2\w k \^&i,i-k<CY,\ w k\k l ~ P L{k) <oo. 

k=0 i=0 k=0 
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Proof of Theorem 11. For each i\ >i2, let 
3^i,i 2 = K(Xi i: Xi 2 ) 

il pA(t-i 2 ) r 

t=i2+l r=l t=ji>— >>>i2+ls=l 

p r 

- E £ A-Kii,i a ,-oo(0,0) II a ^ s -i s £ i, 

r=0|l|= r i 2 >ji>->irs=l 

and 3-11,12 = 3^i2,n if h < *2- We first apply Theorem 3 to show that 

1 n 

(49) £ %,^N(0,O 

11,12=1 

for some a 2 < oo. For the process {3^.j}, it follows easily from (36) that 
lim^oo^ = 0. Hence we focus on verifying the condition (7). Observe that 
for i\ >i2, Vty^^ is equal to 

pA(t-i 2 ) r 

V t K(X il , X i2 ) - E H iui2,i 2 ( X ii-°°,i2i X n) £ II Ja^j's 

r=l t=ji>-- >j r >i2+l s=l 

if Z2 < t < i\, and is equal to VtY^ j 2 if t < Z2- Hence by Lemmas 12 and 14, 
for ii > max(0, 12), 

a _ elln -pf; ^JC&i.iai »2<0<ii, 

Note that i 2 = ^ if «2 < — P- Thus 

00 fe— 1 00 

E min(^ i i_ fc , e) = ^ min(# M _ fc , e) + ^ min(^i+fc,i, e) 

i=0 i=0 i=0 



<C e + ^min^.e) ^0 ase^O, 



\ i=0 / 

since the ipi are summable by Lemma 13. Hence the condition (7) is proved 
for {3^,j} and the proof for (49) is complete. 
Next observe that 

^*il,i2 — 3^ii, £2 "I - ^^ii,i2 

P il r 

:= 3^ii, j 2 + ]£ Ri!,i 2 ,r £ £ rT a n-js e i s i 

r=l t=i2+rt=j'i>--->j' r >i2+l *=1 
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where 

r'=0|l|=r' «2>il>-- >j r ' s=l 

Hence the conclusion of the theorem follows from (49) and Lemma 15. □ 



For k 6 fC with |k| < define 

Mj^J 2 = D] i Ki lj i 2 fi(Xi lj _oo,0) ^2,-00,0) 



E E AAt-Kii,ia -oo(0,0) e n 

r=0 |l|=r 0>ji>j 2 >-->>8=l 



a. 



Lemma 12. Assume that E(e\) < 00, and (Kl) and (K2) /10W. T/ien 
i/iere exists a constant C , independent of 11,12 > n o, such that 

(i) for all\i£ fC with |k| < p — 1, 

ro<] 2 f < C[ai +ai 2+ al ||<f f + a* 1| 2 ]; 

(ii) /or aZZ k G /C smc/i £/ia£ |k| < p, 

HM^J 2 < C[A h (A) + A t2 (4) + < Hk|+1 (2) + Af;l k l +1 (2)]; 

(iii) UPo^^ll 2 < + </£), ^ere ^ = + VA m (4) + A? +1 (2)]. 
PROOF. We first prove (i). Define 

p-|k| r 

Bk(h,i 2 ) = E E AAt-Kii,i3,-oo(0,0) E 1 1 h £ j s , 

r=l |l|=r 0=j'i >32>- >3r s=l 



P"|k| 



Si,k(n,^)=E E A%ifi ll<sr »(0,0) E 



a H s -js £ jsl 



r=l |l|=r-l -l>j2>-- >jr s=2 

p-|k| r 

\^ n. n„ , zv. . (n n^ \ " IT,, 



5 2 ,k(n,«2)=E E A^2,k-Kii,i2 -00(0,0) e n°< 

r=l |l|= r -l -l>J2> - >> s=2 

Then i? k (ii,z 2 ) = aj 1 eo-Bi,k(n,^2) + ai 2 ^o-B2,k(n,«2)- Observe that, for 1 = 1 
and 2, 

||D t>k if il)iai _ 1 (X il> _ oo ,_ lj X ia) _ 0Oi _ 1 ) - B 4)k (ii,i 2 )|| = ||A4% 2+1 ||. 
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Hence by the triangle inequality, 

— OO, — 1; -^-22 , — OO, — 1 

2 

(50) < Y^\ a iM\ D L,k K h,i2 , -l(^i,-oo,-l,X 2 ,-oo,-l) - 5^(^,^)11 



1=1 

2 



El«dll^ 2+ i 



t=l 



By (29) and the triangle inequality, 

\\v mM » 



( 2 I 



< ||-Dk-f^ji,22,o(^ii, -00,0,^12, -00,0) 

-Dk-^ii,i2 , — 1 (-^"ii ,— 00, — 1 ) X«2 ,— 00,— l) 

*i >*2 > — 1 C^ii ,— 00, — 1 j -^-12,— 00,— l) j (^ii ^0 ) ^12 ^0)) || 

+ || (V-Dk -^ii,i 2 ,-i(^n,-oo, -i,Xi 2t -00,-1), (a^eo, a j 2 £ o)) - £k(H)*2)||> 

from which (i) follows in view of (30) and (50). 

To establish (ii) we will adopt a backward induction argument. First, for 

|k| = p, since M^J 2 = D k Ki lt i 2fi (X il _ oofi ,Xi 2 _ OO!0 )-D k K ilj i 2 ^ oo (0,0), (ii) 
follows from (31). Next we make the induction assumption that (ii) holds for 
all k with |k| = m > 1 and we wish to show that it holds for all |k| = m — 1. 
By (i) we have, for any k with |k| = m — 1, 

\\V M$ 2 f < CK + 4 + al ||<£> f + af 2 ||A® fa+1 || a ]. 

(k) 

Since the projections Vt are orthogonal and V%M\ X \ = for t > 1, 



l<Ln 2 = E \\r*<lf =T,m<k»+*f 

t=— 00 t=o 

/ 00 

< C [^(4) +A i2 (4)} +E«+t + «l+ t )[A 1+ t(4) +^ +t (4)] 



oo 



+ E(4+t + ^)1C( 2 ) + <7T (2)] 

t=0 / 

by the induction assumption. Now the induction is complete since a f+t x 
A l+t (4) = o[M4)] and ££0 a^~ m+1 (2) = OL4r™ +2 (2)]. 

Finally, (hi) follows readily from parts (i) and (ii) of this lemma by noting 
that "Pt li^ij and VqY^ -t,i 2 -t = VoM-^_ t i2 _ t have the same distribution. □ 
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LEMMA 13. If p and L(-) satisfy (27), then Ya^i^Pi < 00 . 

Proof. By Karamata's theorem, 
(51) A n (k)=0(na k n ) for k>2. 

So the lemma easily follows from (27) after elementary calculations. □ 



For all i\ > > 12, define 



-3s e js 



o 

t=i 2 +i 

(k) 

where VtN- [ 2 is equal to 

^t K iif 2 ,t{Xi x ,-oo,t, Xi 2 ) 

(pA(t-i 2 ))-k r 

K h,i2,i2 ,-oo,i 2 ,X i2 ) ^2 Tl a n 

r=l t=ji>—>j r >i 2 +ls=l 

if < k < p A (t — 12) and otherwise. The following lemma is very similar 
to Lemma 12. 

Lemma 14. Assume that E(e\) < 00 and conditions (K4) and (K5) 
hold. Let i\ > uq > > 12 and write p' = p A (—12)- Then there exists a con- 
stant C, independent of i\,i2, such that 

(i) for allO<k< p' - 1, 

ll^o<Ul 2 <cK + a^||ivtSUll 2 ]; 

(ii) for allO<k<p', 

\\N^f<C[A ll (A)+A(~ W+ \2)] ] 



(iii) \\V y ilM \\ 2 < C$ 1)i2 , where Cn,i 2 = K|[K| + yjA h+1 (A) + < +1 (2)]. 

Proof. We first prove (i). Fix k < p' — 1. By the triangle inequality and 
(33), 



HWV^Ij < i^oG^n -oo.C-Xia) - -^15° -00,-1, X, 



12, 



a ii £ oKi^2,-i (Xh -00,-1, X i2 , 



+ 



a H e 0-^if,^-l (^1,-00,-1,^2) 

~~ y^-^ii.ioio (^1,-00,12)^2) Il a u-i 3 e i 3 

r=l 0=ji>j 2 >— >ir>i2+ls=l 
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The first term on the right-hand side is bounded by Clw^-i^af by the 
assumption (34). The second term on the right-hand side is bounded by 



K 



(fc+i,o). 



7^(^+1,0)/ Y 



p'-k-l 



where 



~ E K ii,i2,i^ { X h-oo,i2,X i2 ) Yl a h-js £ js 
r=l -l>Ji> -->jr>«2+l s=l 

< loiJUJvggUill + KIII^W - ivggUill. 

W ilt i 2 ,k = K i ^^ +lfi (X il+ i- 007 o,Xi 2+1 ) 

-"■11+1,12 + 1,12 + 1 ^ ^11+1,-00,12 + 1! ^-12 + U 

p'-k-l 

E^(fc+l + r,0) ,y- y N 

^-U+l,i 2 +l,i2+lV^-ii+l,-oo,i2+l 1^-12+1/' 

r=l 



x E 1 1 " ' i 

0>jl>--->jr>i2+2 S=l 



Observe that 



iiww-^+L+iii 

p'-fc-l 

< E E 

t=i 2 +l r=(pA(t-i 2 ))-fe+l 



<G>; 



A' 



(fc+l+r,0) / y 

1 + 1,42 + 1,42 + 1 1^1+1,-00,12+1 



1-Xi2+1) 



x E 1 1 " ' . • i +-+ 

t=Jl>— >jV>l2+2 s=l 



Hence (i) follows. 

To establish (ii), we will adopt a backward induction argument. First, for 
k = p' , (ii) follows from (35) in view of 



jv/V) _ i<-O'> ) (v y ^ 

11,12 — -"-11,12,0^*1,-00,0)^-12/ 



^hMM i^-fa ,— oo,i2 ) -^Qa > 



Next we make the induction assumption that (ii) holds for k = m > 1 and 
we wish to show that it holds for k = m — 1 . By (i) we have 

\\VoN^f<C[4 1+ al\\N^\\ 2 ]. 
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Since the projections Vt are orthogonal, 

ii<tn 2 = E ii^^ii^'eVo^^i 



t=i 2 +l 



t=0 



< Cwl, A h (4) + £ a 2 ll+t A ll+t {A) + £ af 1+ ^ 



'-m+l 
ii+t 



(2) 



t=o 



t=0 



by the induction assumption. Now the induction is complete since J2t^=o a i+t x 



A l+t (±) = o[M4)} and E t =o«? +t <r +1 (2) = 0[Af 



(2)]- 



Finally, (hi) follows readily from parts (i) and (ii) of this lemma by noting 



that V t y iui2 and Ww-t^-t = V Q N^_ tM _ t 



have the same distribution. □ 



Lemma 15. Under the conditions of Theorem 11, 



-3/2 



E WiiA-^O. 



Proof. Recall that 



p—r 



-E E a<;^oo(o,o) E LK-^> 

r'=0|l|=r' i2>ji>-- >j r i s=l 

and hence we have for i\>i2> 0, 

UPoiWII = II^SSlI < ckiiki + V^ 2+1 (4) + Af 2 "; 1 (2)] 

~ Ca l2 (m i2 ) ( ^ r)/2 ~ i 2 - 5 L^ +1 (f 2 ) =: % , 

where <5 = (p — r + l)(/3 — 1/2) + 1/2 by Lemma 12 and (51). Projecting 
iteratively, we obtain 

n r 2 

E ^»i)*2,r E E a n-ji II a h-js £ js 

ii>i2=l ia+r<t<ti t=ii>—>jV>»2+l s =l 

2 



E 

n>ii>-->t r >l 



n tf— 1 r 



E E II a H-^ 

«l=tl 12 = 1 8=1 



E E 

ri>t\>— >t r >l t'=—oo 



n t r —X r 



E E E[ a h-ts'Pt'RiiM,r 
il=t\ i 2 =!Vt' s=l 



t r -l / n t r -l r \ 2 

<c E E E E IR-v*-* • 

n>tl>--->t r >l t'=—oo \ h=ti i 2 =lVt' s=l / 
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Now approximating summations by integrals, the last expression can be seen 
to be asymptotically equal to Cn r+5 - 2 ( r ^ +5 )(L(n)) 2 ^- r+1 ) x INT for large 
n, where 



t r ( rl T \ 2 / r t r -l \ 2 

l\—5 



int= / / / -*-)"" / (w-o 

Jl>t 1 >--->t r >0Jt'=~oo\Jx=t ls=1 J \Jy=0Vt' 

Since 

r + 5 - 2{r(3 + 5) = 4 - (p + 1)(2(3 - 1) < 3, 

where the equality holds only if L(n) — > by (27), the result will follow if 
we can show that INT < oo, which is what we will do. First, 

1 

Jx=t! g=1 

< ( fl _ i2 )-(^-i) /°° 3.-/9(3; + fr ( a + tizlt] ~ P dx 

Jo Z-oV h-hJ 



< (tl _ t2 )-(^-D TT ( t-L-Jl) f°° X -P( X + i)-/3 dx 

s=3 

Hence we have 

r r 

-2/3 



/iVT<C<f ( tl _ t2 )-2(2/3-l)-Q (tl _ ts 

Jl>tx>->t r >0 s=3 

(fl,-/)-) 2 

, \Jy=Q\/t> J 



t' = -oo \Jy=0Vt' 

Writing 

"U / rtr-l \ 2 



(T (y-t'r S dy) dt> 

\Jy=0Vt' J 



t' = -oo \Jy=0Vt' 



-1 r ft r \ / ptr-1 \ 2 



+ / / (y-tT 5 dy) dt', 

t'=-oo Jt'=-1 Jt'=0/ \Jy=0Vt' J 

it is easy to see that all three integrals are uniformly bounded since 1/2 < 
5 < 1. Also, integrating iteratively from t r to t$, we obtain 



/ i{{t,-t s y^du---dt r <c{t l -t 2 )-^ 

Jt 2 >->t r >0 e=3 
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Since r(2/3 - 1) < 1 



r 



l>tl>-->t r >0 



(t 1 -t 2 T 2 W- 1) U(ti-t s T 2 Pdt 1 ...dt r 



s=3 



[ r (t 1 -t 2 )- r ^- i) dt 1 dt 2 <oo 



Jt 1 =oJt 2 =o 



Hence we conclude that INT < oo and the proof is complete. □ 
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